METHODOLOGICAL ARTICLE 



Advances in Cognitive Psychology 



Confidence intervals 
for two sample means: 
Calculation, interpretation, 
and a few simple rules 

Roland Pfister and Markus Janczyk 

Department of Psychology III, Julius Maximilians University of Wurzburg, Germany 



ABSTRACT 



KEYWORDS 

confidence intervals, 
graphical data presentation, 
repeated measures, 
within-subjects designs, 
between-subjects designs 



Valued by statisticians, enforced by editors, and confused by many authors, standard errors (SEs) 
and confidence intervals (C/s) remain a controversial issue in the psychological literature. This is 
especially true for the proper use of C/s for within-subjects designs, even though several recent 
publications elaborated on possible solutions for this case. The present paper presents a short and 
straightforward introduction to the basic principles of CI construction, in an attempt to encourage 
students and researchers in cognitive psychology to use C/s in their reports and presentations. 
Focusing on a simple but prevalent case of statistical inference, the comparison of two sample 
means, we describe possible C/s for between- and within-subjects designs. In addition, we give 
hands-on examples of how to compute these C/s and discuss their relation to classical f-tests. 



If you take care of the means, the end will take care of itself 

Mahatma Ghandi 

STATE OF AFFAIRS 

Recent developments in psychological research methods converge on 
the notion that each mean is best accompanied by an appropriate con- 
fidence interval (CI) and, consequently, CIs are discussed in many con- 
temporary statistical textbooks (e.g., Bortz & Schuster, 2010; Howell, 
2012). Interestingly, this mainly holds true for between-subjects designs 
for which CJs are relatively easy to compute (Cumming & Finch, 2005). 
In contrast, standard errors (SEs) and CJs for within-subjects designs 
are still mysterious for many researchers (cf. Belia, Fiedler, Williams, & 
Cumming, 2005) even though several excellent publications elaborated 
on appropriate CJs for this situation during the last decades (Loftus & 
Masson, 1994; see also Cousineau, 2005; Estes, 1997; Franz & Loftus, 
2012). 

Most of these approaches to CJs for within-subjects desgins, how- 
ever, are rather difficult to understand because they rely on relatively 
advanced measures such as the error term of the repeated-measures 
Analysis of Variance (ANOVA). So, while potentially applicable to 



a wide range of studies, CJs for within-subjects designs are widely 
misunderstood (Belia et al., 2005) and rather complicated to calculate 
(Bakeman & McArthur, 1996). This dilemma is especially relevant for 
cognitive psychologists who tend to rely heavily on within-subjects 
designs (e.g., as compared to researchers in personality or social psy- 
chology; cf. Erlebacher, 1977; Keren, 1993). 

Nonetheless, reporting appropriate CJs has become an essential 
component of American Psychological Association (APA) style (APA, 
2010; Wilkinson & the Task Force on Statistical Inference, 1999), and 
nowadays many journal editors encourage authors to add CJs or SEs 
when reporting their data. Further, as teachers we often push students 
to aid their data presentation with CJs, be it in-class data presentation 
or for presentations at scientific meetings. Yet, an easy, tutorial-like 
explanation on how to choose and calculate these CJs is missing. 
To fill this gap, we focus on a simple and often applied statistical 
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analysis - the comparison of two means from independent groups as 
well as from dependent groups/conditions - and describe appropriate 
CIs in an intuitive framework. In this framework, we suggest that us- 
ing a much simpler approach to within-subjects CIs than suggested 
in often-referenced papers (e.g., Loftus & Masson, 1994) is preferable 
in most cases. In particular, this approach simply relies on the CI of 
the difference between sample means (see Franz & Loftus, 2012, for a 
more detailed discussion of this approach and its advantages over other 
approaches). Such CIs are more initimately related to their between - 
subjects counterparts, are easily obtained with any computer program, 
and allow for a straightforward interpretation. 

In the following section, we outline how different CIs can be 
computed for the common situation of comparing two sample means 
(cf. Table 1). These guidelines are intended to simplify graphical data 
presentation in a unifying framework that is intimately related to the 
different f-tests in classical hypothesis testing. 

A ens A CI IS A CI 

Independent of the underlying design, any CI for a sample mean can 
be broken down to a simple formula that only includes the mean itself, 
an appropriate SE, and a coefficient that is derived from the t- distribu- 
tion. In the following, we will use 95% CIs (i.e., a = .05) in all examples 
because of their widespread use in the literature (cf. Equation 1): 

95% CI = M±SE-t df ^ (1) 

All CIs computed with this formula rely on the same assumptions 
as t- tests in classical hypothesis testing do. More precisely, they assume 



a normally distributed variable with an unknown population variance 
that is estimated from the sample (thus implying measurement at the 
interval scale). Furthermore, these CIs are inherently two-tailed, as 
reflected by the use of a/2 to determine the coefficient. 

Most importantly, particular CIs differ in how the corresponding 
SE is computed, and the appropriate formula depends on two factors: 
(1) the experimental design and (2) the intended meaning of the CI. 
We start by discussing two CIs for between-subject designs before 
continuing with within-subjects designs. Following these theoretical 
points, we demonstrate how to compute the three CIs for an examplary 
data set. 

Between-subjects designs: 
Two independent samples 

For between-subjects designs, two distinct CIs can be computed that 
differ in meaning and interpretation. At first sight, the most straight- 
forward way might be to compute separate CIs for each individual 
mean M by simply using the corresponding SE. In fact, this is a valid 
solution and we will denote the resulting SE as SE M . Following from 
the central limit theorem, SE M is computed by dividing the unbiased 
estimator of the standard deviation (s) by the square root of the sample 
size n (see Equation 2): 

SE M -j= ^s = ^ = ^.±( Xi -M x f 0) 

The corresponding CI for individual means is denoted as CI M 
(cf. Equation 3): 

95% CI M =M±SE M -t n _ tom5 (3) 



TABLE 1. 

Fundamental Concepts for the Graphical Data Presentation of Two Means and the Associated Confidence Intervals 



Parameter 



Standard error (SE) 



A parameter is a fixed, but unknown population value. Sample statistics are used to estimate 
parameters. 

Measure for the standard deviation of a parameter estimator. In case of a sample mean, it is 
equal to the estimated standard deviation divided by the square root of the underlying sample 



Confidence interval (CI) 



Confidence interval for 
an individual mean (CI M ) 

Confidence interval for 
the difference between two 
means from independent 
samples (CJ D ) 

Confidence interval for the 
paired difference between two 
means (CI ) 



An estimate for plausible population parameters. Several different CIs can be constructed 
for the comparison of two means, depending on the employed design and the desired 
interpretation. Still, each CI can be broken down to the simple formula: "Mean ± Standard 
Error x Coefficient" (CI = M±SEx t l a/2 ). 

This CI is constructed from the standard error of the mean (SE U ) and can be used to compare 
this mean to any fixed parameter. It corresponds to a one-sample f-test and does not yield any 
precise information about the difference between two sample means. 

This CI is constructed from the between-subjects standard error of the difference between 
two means (SE D ). It thus corresponds to a f-test for independent samples and can be used for 
inferences about the difference between both means. 

This CI is constructed from the standard error of the difference between two dependent sample 
means (paired differences). It is thus applicable for within-subjects designs and equivalent to 
a paired -samples f-test. 
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This confidence interval can be used for inferring whether the mean 
is different from any fixed, hypothesized value (e.g., zero). Thus, the 
CI M corresponds to the one-sample f-test. The CI M , however, does not 
yield any precise information about the difference between its mean 
and any other sample mean. To obtain this information, one needs to 
compute a different SE that captures the (between-subjects) variability 
of the difference between both means. This measure - SE - is com- 
posed of both sample sizes (n x , n 2 ) and estimated standard deviations 
(s 1 and s 2 , respectively; see Equation 4): 



se J(n,-lK + ( n2 -1)s 2 2 £T + _1 



(4) 



the means of two paired samples, however, a straightforward and 
elegant solution seems to be more closely related to meaning and 
interpretation of CIs for between-subjects designs (see also Franz & 
Loftus, 2012). This solution simply uses the standard error of the paired 
differences (SE m ) to construct the CI. Accordingly, it does not require 
any ANOVA statistics, but can be computed easily from the standard 
deviation of individual difference scores s d (see Equation 6): 



SE PI 



3_ 

4n 



(6) 



The corresponding CI is labelled CI pD (following Franz & Loftus, 
2012; cf. Equation 7): 



The corresponding CI for the difference between two independent 
means is denoted as CI D (cf. Equation 5): 



95%C/ PD =M i;2 ±SE PD -f„_ 1; 0 . 975 



(7) 



95% Cl n =M,„±SE n t„ 



+n 7 -2; 0.975 



(5) 



In addition to the general assumptions mentioned above, the CI D 
assumes that the standard deviations are estimated from independent 
samples and that the size of these standard deviations is comparable 
(i.e., we assume homogeneity of variance) 1 . Importantly, conclusions 
based on the CI D are valid only for the difference between the means, 
and the CI D thus corresponds to the f-test for two independent samples. 
If centered around one of the means this test is significant if, and only 
if, the CI D does not include the other mean. Accordingly, the CI D can 
be used for inferences about the statistical significance of the between- 
subjects difference; and because the difference between sample means 
is what a researcher will typically be interested in, CI D is preferable to 
CL. in most circumstances. 

M 

To conclude, CI D and CI M are intimately related to classical f-tests 
and allow for a straightforward interpretation: A standard f-test is sig- 
nificant if, and only if, the 95% CI does not include the value in ques- 
tion. For the CI D , this value is the second sample mean, for the CI M , 
this is any fixed parameter value. 

Within-subjects designs: 
Two paired samples 

For within-subjects designs, matters seem to be more complicated at 
first sight. In fact, Cumming and Finch (2005) recommended: "For 
paired data, interpret the mean of the differences and error bars for 
this mean. In general, beware of bars on separate means for a repeated- 
measure independent variable: They are irrelevant for inferences about 
differences" (p. 180). 

Caution is indeed necessary in this situation, because the often- 
used CI M obviously is unrelated to the within-subjects difference. Yet, 
several CIs for within-subjects designs have been proposed in the 
last decades (Cousineau, 2005; Jarmasz & Hollands, 2009; Loftus & 
Masson, 1994) with the most prevalent variant being the one of Loftus 
and Masson. These CIs are typically derived from the error term of the 
repeated-measures ANOVA and we will come back to these methods 
in the section What to do with more complex designs. For comparing 



The CI pD is thus equivalent to the confidence interval of the dif- 
ference between both paired means and corresponds directly to a 
paired-samples f-test. When plotted around the actual sample means, 
this f-test is significant if one mean is not part of the CI pD around the 
other mean; consequently CI pD is a direct within-subjects counterpart 
of the CI D for independent samples. Taken together, we suggest that the 
CI pD can be computed more easily and seems to be more closely related 
to interpreting the difference between two dependent means than any 
other solution. 

AFFECTION, PHEROMONES, AND CIS: 
A HANDS-ON EXAMPLE 

Table 2 shows the data of a fictitious - and rather arbitrary - study in 
which participants indicated their affection for the experimenter on a 
rating scale. This scale ranges from -10 {dislike) to 0 (neutral) to 10 (af- 
fection). Condition 1 is a control condition without any specific treat- 
ment whereas the experimenter used a healthy dose of pheromones in 
Condition 2. 

Different CIs are possible in this situation, depending on the actual 
design and the Cfs intended meaning. The most important question, 
of course, relates to the design: Different CIs are appropriate depending 
on whether the data result from a between-subjects design (different 
participants contributed to Condition 1 and Condition 2, respectively) 
or a within-subjects design (the data in each row belong to a single par- 
ticipant). The three different CIs described above are plotted in Figure 1 
and will be discussed in the following (see Appendix A for a short 
tutorial on how to compute these intervals with common computer 
programs). 

Between-subjects: 

CIs for individual means 

Confidence intervals for individual means can be computed easily 
based on the two standard deviations in Table 2. Accordingly, the two 
SEs amount to the following values (Equation 8): 



_ s, 1.92 s 2 1.30 



(8) 
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Between-Subjects 



Raw Data 



Within-Subjects 



a: CI M 
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FIGURE 1. 

Three different confidence intervals (C/s) for two sample means. The raw data are plotted in the center of the figure; dots represent 
individual data points (five observations per mean; see also Table 2). Panels A and B show C/s that are approriate for between-subjects 
designs; Panel C shows a C/that is appropriate for within-subjects designs (pairs of values are indicated by dashed lines in the raw data). 
Panel A. C/s for individual means (CI 'J rely on the standard error (SE) of the corresponding mean. The C/ M indicates whether this mean 
is significantly different from any given (fixed) value. They do not inform about the statistical significance of the difference between 
the means. Panel B. C/for the difference between the means (C/ D ).The means are significantly different (as judged by f-tests for inde- 
pendent samples) if one mean is not included in the C/ D around the other mean. Panel C. Within-subjects CI, constructed from the 
paired difference scores (C/ pD ).Two means from paired samples are significantly different (as judged by a paired-samples f-test) if one 
mean is not included in the C/ Dn around the other mean. 



■ 

TABLE 2. 

Example Data 



Reported affection for the experimenter 
as indicated on a rating scale (-10 to 10). 





Condition 1 


Condition 2 


Observation 


(control) 


(pheromones) 


1 


7 


8 


2 


3 


5 


3 


4 


6 


4 


2 


5 


5 


5 


7 


M 


4.20 


6.20 


s 


1.92 


1.30 



Note. Condition 1 is a control condition without any specific treatment, 
whereas the experimenter had used a dose of pheromones in Condition 2. 
In the following equations, we will use the indices 1 and 2 to refer to the 
control condition and the pheromone condition, respectively. 



Both SEs are then multiplied with the critical t- value of t x = 
2.78 to compute the respective CI (see Equations 9 and 10). These two 
CI U are plotted in Panel A of Figure 1. 

95% C/ M(1) = 4.20 ± 0.86 • 2.78 = 4.20 ± 2.39 (9) 

95% CI M{2) = 6.20 ± 0.58 • 2.78 = 6.20 ± 1 .62 (10) 

The two CI U indicate that the mean affection ratings are signifi- 
cantly different from zero for both conditions, that is, participants were 
positively biased toward the experimenter even when not affected by 
pheromones. Importantly, however, the CI U are not informative for the 
difference between the affection rating of the control participants and 
the participants who were exposed to pheromones. 

Between-subjects: 
CIs for the difference 

The SE D is equivalent to the SE that is used for the f-test for inde- 
pendent samples (Equation 11): 
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n 1 + n 2 - 2 V n. 



(11) 



_ f (5-1)-1.92 2 + (5-1)-1.30 2 O 
V 5+5-2 i5 5 

Because the difference between both (independent) means in- 
volves subjects of both groups, the CJ D is computed by multiplying the 
SE D with the appropriate critical t- value of £ 5 + 5 _ 2 . 0975 = 231 ( see Equa- 
tion 12): 



95% C/ n = M /9 + 1 .04 • 2.31 = ML„ ± 2.40 



(12) 



This CI D is plotted around each mean in Panel B of Figure 1. 
The mean rating of the control participants is clearly included in 
the CJ D around the mean of the participants who were exposed to 
pheromones - both values are thus not significantly different as judged 
by a f-test for independent samples. 

Within-subjects: 
CI for the difference 

In contrast to the previous CJs, we now assume the data in Table 2 
to result from a within-subjects design: Participants were first tested 
in the control condition and then exposed to the pheromones (or 
vice versa). Accordingly, the two ratings in each row of Table 2 are 
now assumed to belong to the same individual. The now appro- 
priate CJ pD is based on the pairwise difference scores for the data 
in Table 2 (Condition 2 - Condition 1). These differences scores 
are 1, 2, 2, 3, and 2 for the five participants and their standard 
deviation is s d = 0.71. The corresponding SE pD amounts to (see Equa- 
tion 13): 



_ s d _0-71_ Q32 



With a critical f-value of t 



(13) 



2.78, we can now compute 



the CJ pD (Equation 14): 



95% CI PD = M V2 ± 0.32 • 2.78 = M V2 ± 0.88 



(14) 



The CJ pD is plotted around each mean in Panel C of Figu- 
re 1. The mean of the control condition is clearly not included in 
the CJ pD around the mean of the pheromones condition. This is 
equivalent to a significant effect as revealed by a paired-samples 
f-test. 

DECIDING WHAT TO PLOT 

As we have seen in the above example, different and equally possible 
SEs and CJs for a given situation can vary substantially and do convey 
different information. On closer inspection, the question which one to 
plot boils down to the question whether the difference between the two 
means is of major interest or not. 



If the difference is indeed of interest, we suggest that each mean is 
best accompanied by the CI of the difference that is appropriate for the 
employed design (i.e., either CJ D or CJ pD ). As noted above, these in- 
tervals allow direct inferences about the difference and have also been 
labelled inferential CJs for this reason (Tryon, 2001). In addition to 
plotting these CJs, it is of course equally important to describe what is 
plotted. Here, a typical description to be used in a figure caption would 
be "Error bars represent the XY% confidence interval of the difference". 
Alternatively, a concise description is also possible with the nomencla- 
ture suggested in this article that can be used to specify the plotted CI 
or SE on the axis of a graph (e.g., "RT ± SE m " or "RT and CJ pD " for a 
within-subjects design using response time as dependent variable). 

An additional option to this approach can be used if it is only the 
difference that counts whereas the actual means are not of interest. In 
this case, it is also possible to plot only the difference itself, accompa- 
nied by the corresponding CI (i.e., CJ D or CJ pD ). 
If the difference between the two means is not of major interest, how- 
ever, we suggest to plot the CJ M or SE U for each individual mean. Here, 
a typical description to be used in a figure caption would be "Error 
bars represent the XY% confidence interval of the individual means" 
or, to use the suggested nomenclature, a similar statement on the axis 
of a graph (e.g., "RT ± SE M " or "RT and CJ M "). These error bars inform 
about the homogeneity of variance across different samples or condi- 
tions and - even though they cannot be used for inferences about the 
difference between two means - they provide information about the 
difference of each mean from a fixed parameter. 

WHAT TO DO WITH MORE COMPLEX 
DESIGNS? 

The framework described in the preceding sections provides a straight- 
forward and intuitive approach to CJs for means from two conditions 
for both, between- and within-subjects designs. These CJs can be 
mapped directly to the different f-tests in classical hypothesis testing 
and, as mentioned above, they also rely on the same statistical assump- 
tions as the corresponding test. The described method of plotting the 
appropriate CI for the difference - CJ D or CJ pD , respectively - can also 
be applied to more complex designs given that specific pairwise com- 
parisons are crucial for the research question at hand (Franz & Loftus, 
2012). If applicable, this method might indeed be the easiest and thus 
favorable strategy. 

Still, this approach has obvious limitations regarding complex stu- 
dies which include numerous conditions. In such factorial designs, CJs 
are typically constructed from the error term of the ANOVA omnibus 
test. For between-subjects designs, appropriate methods are described 
comprehensively in several publications (e.g., Keppel & Wickens, 
2004; cf. also Estes, 1997). As noted above, different methods have 
been proposed also for factorial within-subjects designs (Cousineau, 
2005; Jarmasz & Hollands, 2009; Loftus & Masson, 1994) with the most 
prevalent variant being the one of Loftus and Masson (1994; see also 
Baguley, 2012; Bakeman & McArthur, 1996; Masson & Loftus, 2003; 
Hollands & Jarmasz, 2010; Tryon, 2001). Using these methods, howev- 
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er, requires a precise understanding of what these CIs reveal about the 
data. For instance, within-subject CIs according to Loftus and Masson 
(1994) are not directly equivalent to t- tests for paired samples but have 
to be multiplied by a fixed factor to allow for inferences about possibly 
significant effects (i.e., in the case of two groups/conditions: CJ pD = V2 
x loftus & MassoJ • Excellent examples on how to compute and interpret 
these CIs can be found in the corresponding articles. 

CONCLUDING REMARKS 

In the preceding sections, we have summarized three approaches to 
CIs for one of the most common designs in psychological research, that 
is, the comparison of two sample means. Clearly, different CIs need to 
be computed for between- and within-subjects designs (cf. Blouin & 
Riopelle, 2005; Cumming & Finch, 2005; Estes, 1997; Loftus & Masson, 
1994) and the particular CI used in a plot needs to be specified. To 
this end, we suggested an easy nomenclature for three different CIs to 
facilitate communication about what exactly a given CI represents (see 
Table 1). Furthermore, we argue that CIs for the difference between 
two means (CJ D and CJ pD ) are most informative in the majority of 
cases, because they can be interpreted intuitively. These CIs provide a 
straightforward approach to the described setting; more complex de- 
signs of course call for different approaches to CIs which can be found 
in a variety of recent articles. 

FOOTNOTES 

1 In the rare case of two equally sized samples with numerically 
identical standard deviations, the is informative also for the dif- 
ference between the means. Here, it is directly related to SE with 
SE D = V2 x SE M . If sample sizes or standard deviations are (even slight- 
ly) dissimilar, however, this relation is not valid. It should also be noted 
that this relation is only valid for SEs but not for the corresponding CIs: 
The coefficient of the CI U has n - 1 degrees of freedom (df) whereas the 
coefficient of the CI D has (n l + n 2 - 2)df. 
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APPENDIX A 

In the following, we show how the different CIs can be computed 
by common software packages, such as SPSS, MS Excel, and R. 

SPSS 

CIs for individual means (CJ M ) can be computed with the Explore 
command: 

Analyze > Descriptive Statistics > Explore 

Here, the menu Statistics allows to set the a level (default: 5%). 
Alternatively, the CI U is also contained in the output of the one-sample 
f-test in a section labelled 95% Confidence Interval of the Difference 
(with the a level being set in the Options menu). Both ways of comput- 
ing the CI result in a CI that is specified via lower and upper boundaries 
which can be easily transformed to the the notation that is used in this 
article (see Equation Al): 



CL=M 



, Upper -Lower 
= M±— — 



(Al) 



CIs for the difference between independent means (CJ D ) and CIs for 
the difference between paired means (CJ pD ) can be obtained by using 
the same formula on the output of the f-test for independent-samples 
and the paired-samples f-test, respectively These outputs also contain 
the corresponding values for SE D or S£ pD . 

Microsoft Excel 

Computing CIs for individual means (CJ M ) in MS Excel requires seve- 
ral quick steps. First, the standard deviation s is computed with STDEV 
function. Dividing this value by Vn returns the SE M . This can be done 
by using SQRT(n) or by computing n with the COUNT function. 

Finally, the SE M is multiplied with the coefficient taken from the t- 
distribution which can be accessed via the TINV function. Importantly, 





A 


B C 


1 


7 


s] =A1-B1 


2 


3 


5 




4 


6 


4 


2 


5 


5 


5 


~7]~ 


6 










B 


. — c — . 


D 


1 


3 


-1 




2 


5 


-2 




3 


6 


-2 




4 


5 


-3 




5 


7 


-2 








=STDEV(C1:C5)/SQRT(5) 


7 







FIGURE Al. 

Computing the 5E pD in MS Excel. Upper panel: Pairwise differences 
are computed for each case. Lower panel: The standard deviation 
of these difference scores is divided by Vfi to obtain the 5E pD . 



TINV is inherently two -tailed and takes the intended a level as input. 
Thus, calling TINV(0.05, n-l) will result in the correct value for t 975 . 

The CI for the difference of two independent means, CI D , is com- 
puted similarly with two changes. Most importantly, the SE D is com- 
puted using the corresponding formula in the section Between-subjects 
designs: Two independent samples (using SQRT for the square root and 
" A 2" to denote exponents). Furthermore, the critical t- value has to be 
requested with the correct number of dfs via 7TNV(0.05, n l + n 2 - 2). 

In contrast to CI U and CI D , the CI for the difference between paired 
means, CJ pD , requires an additional first step. Assuming that each 
condition is entered in a separate column, one first needs to compute 
pairwise differences (Figure Al). The estimated standard deviation of 
the difference scores s d can now be computed via the STDEV function. 
Dividing this value by returns the SE m . This can again be done by 
using SQRT(n). CJ pD is then computed by multiplying the SE pD with the 
coefficient computed by TINV(0.05, n-l) for t n _ 1-0 . 



Confidence intervals are included in the output of the function t.test. 
As in SPSS, CIs are typically given in terms of lower and upper bounda- 
ries. These values can be accessed directly to arrive at the notation that 
is used in this article: 



Clu 



, Upper -Lower 

-M±-^- 
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(A2) 



Accessing the boundaries works similarly for all f-tests and we will 
demonstrate the general procedure for the one-sample f-test and the 
corresponding CI M . First, we enter the data of Condition 1 as a vector 
and compute the one-sample f-test via t.test. The output is stored in the 
new variable result: 

> condl <- c(7,3,4,2,5) 

> result <- t.test(condl) 

The boundaries can now be addressed by result$conf.int which re- 
turns a vector containing both values. The length of an individual error 
bar can now be computed in the following way: 

> (max(result$conf.int) - min(result$conf.int))/2 [I] 2.388388 
Alternatively, R allows for a manual computation of SEs and CIs 

just as in MS Excel. We demonstrate this procedure for the CJ pD . For 
the sake of simplicity, we assume the data to be coded in vectors instead 
of being variables in a data frame. 

> condl <- c(7,3,4,2,5) > cond2 <- c(8,5,6,5,7) 

We then compute a difference vector and its length (i.e., the number 
of participants): 

> diff <- condl -cond2 >n <- length(diff) 

Then, the standard deviation of these difference scores is divided 
by and multiplied by t n x . 0 975 . The latter coefficient is computed via 
the function qt. 

> errorbar <- sd(diff) I sqrt(n) * qt(0.975,n-l ) 

The resulting error bar is used to compute upper and lower 
boundaries of the CJ pD for both condition means: 

> cil <- c(mean(condl) - errorbar, mean(condl) + errorbar) 

> ci2 <- c(mean(cond2) - errorbar, mean(cond2) + errorbar) 
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